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ABSTRACT 

We analyze the computational complexity of market maker 
pricing algorithms for combinatorial prediction markets. We 
focus on Hanson's popular logarithmic market scoring rule 
market maker (LMSR). Our goal is to implicitly main- 
tain correct LMSR prices across an exponentially large out- 
come space. We examine both permutation combinatorics, 
where outcomes are permutations of objects, and Boolean 
combinatorics, where outcomes are combinations of binary 
events. We look at three restrictive languages that limit 
what traders can bet on. Even with severely limited lan- 
guages, we find that LMSR pricing is #P-hard, even when 
the same language admits polynomial-time matching with- 
out the market maker. We then propose an approximation 
technique for pricing permutation markets based on a recent 
algorithm for online permutation learning. The connections 
we draw between LMSR pricing and the vast literature on 
online learning with expert advice may be of independent 
interest. 
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1. INTRODUCTION 

One way to elicit information is to ask people to bet on it. 
A prediction market is a common forum where people bet 
with each other or with a market maker [5J 1101 1231 I2UI I21| . 
A typical binary prediction market allows bets along one di- 
mension, for example either for or against Hillary Clinton to 
win the 2008 US Presidential election. Thousands of such 
one- or small-dimensional markets exist today, each operat- 
ing independently. For example, at the racetrack, betting 
on a horse to win does not directly impact the odds for that 
horse to finish among the top two, as logically it should, 
because the two bet types are handled separately. 

A combinatorial prediction market is a central clearinghouse 
for handling logically-related bets defined on a combinato- 
rial space. For example, the outcome space might be all 
n! possible permutations of n horses in a horse race, while 
bets are properties of permutations such as "horse A finishes 
3rd" or "horse A beats horse B." Alternately, the outcome 
space might be all 2 50 possible state-by-state results for the 
Democratic candidate in the 2008 US Presidential election, 
while bets are Boolean statements such as "Democrat wins 
in Ohio and Florida but not in Texas." 

Low liquidity marginalizes the value of prediction markets, 
and combinatorics only exacerbates the problem by divid- 
ing traders' attention among an exponential number of out- 
comes. A combinatorial matching market — the combinato- 
rial generalization of a standard double auction — may sim- 
ply fail to find any trades [111 |4j [5] . 



In contrast, an automated market maker is always willing to 
trade on every bet at some price. A combinatorial market 
maker implicitly or explicitly maintains prices across all (ex- 
ponentially many) outcomes, thus allowing any trader at any 
time to place any bet, if transacted at the market maker's 
quoted price. 

Hanson's [131 [14] logarithmic market scoring rule market 
maker (LMSR) is becoming the de facto standard market 
maker for prediction markets. LMSR has a number of desir- 
able properties, including bounded loss that grows logarith- 
mically in the number of outcomes, infinite liquidity, and 



modularity that respects some independence relationships. 
LMSR is used by a number of companies, including inkling- 
markets. com, Microsoft, thewsx.com, and yoonew.com, and 
is the subject of a number of research studies [7] 1151 [8] . 

In this paper, we analyze the computational complexity of 
LMSR in several combinatorial betting scenarios. We exam- 
ine both permutation combinatorics and Boolean combina- 
torics. We show that both computing instantaneous prices 
and computing payments of transactions are #P-hard in all 
cases we examine, even when we restrict participants to very 
simplistic and limited types of bets. For example, in the 
horse race analogy, if participants can place bets only of 
the form "horse A finishes in position N", then pricing these 
bets properly according to LMSR is #P-hard, even though 
matching up bets of the exact same form (with no market 
maker) is polynomial [4]. 

On a more positive note, we examine an approximation 
algorithm for LMSR pricing in permutation markets that 
makes use of powerful techniques from the literature on on- 
line learning with expert advice [3l ll9llT2] . We briefly review 
this online learning setting, and examine the parallels that 
exist between LMSR pricing and standard algorithms for 
learning with expert advice. We then show how a recent 
algorithm for permutation learning [16] can be transformed 
into an approximation algorithm for pricing in permutation 
markets in which the market maker is guaranteed to have 
bounded loss. 

2. RELATED WORK 

Fortnow et al. [11] study the computational complexity of 
finding acceptable trades among a set of bids in a Boolean 
combinatorial market. In their setting, the center is an auc- 
tioneer who takes no risk, only matching together willing 
traders. They study a call market setting in which bids are 
collected together and processed once en masse. They show 
that the auctioneer matching problem is co-NP-complete 
when orders are divisible and Ej-complete when orders are 
indivisible, but identify a tractable special case in which 
participants are restricted to bet on disjunctions of positive 
events or single negative events. 

Chen et al. [4] analyze the the auctioneer matching prob- 
lem for betting on permutations, examining two bidding 
languages. Subset bets are bets of the form "candidate i 
finishes in positions x, y, or z" or "candidate i, j, or k fin- 
ishes in position x." Pair bets are of the form "candidate i 
beats candidate j." They give a polynomial-time algorithm 
for matching divisible subset bets, but show that matching 
pair bets is NP-hard. 

Hanson highlights the use of LMSR for Boolean combinato- 
rial markets, noting that the subsidy required to run a com- 
binatorial market on 2™ outcomes is no greater than that 
required to run n independent one-dimensional markets [131 
114] , Hanson discusses the computational difficulty of main- 
taining LMSR prices on a combinatorial space, and proposes 
some solutions, including running market makers on overlap- 
ping subsets of events, allowing traders to synchronize the 
markets via arbitrage. 

The work closest to our own is that of Chen, Goel, and Pen- 



nock [6] , who study a special case of Boolean combinatorics 
in which participants bet on how far a team will advance 
in a single elimination tournament, for example a sports 
playoff like the NCAA college basketball tournament. They 
provide a polynomial-time algorithm for LMSR pricing in 
this setting based on a Bayesian network representation of 
prices. They also show that LMSR pricing is NP-hard for a 
very general bidding language. They suggest an approxima- 
tion scheme based on Monte Carlo simulation or importance 
sampling. 

We believe ours are the first non-trivial hardness results and 
worst-case bounded approximation scheme for LMSR pric- 
ing. 

3. BACKGROUND 

3.1 Logarithmic Market Scoring Rules 

Proposed by Hanson [131 114] , a logarithmic market scoring 
rule is an automated market maker mechanism that always 
maintains a consistent probability distribution over an out- 
come space fl reflecting the market's estimate of the like- 
lihood of each outcome. A generic LMSR offers a security 
corresponding to each possible outcome ui. The security as- 
sociated to outcome u) pays off $1 if the outcome ui happens, 
and $0 otherwise. Let q = (qu)uen indicate the number 
of outstanding shares for all securities. The LMSR market 
maker starts the market with some initial shares of secu- 
rities, q° , which may be 0. The market keeps track of the 
outstanding shares of securities q at all times, and maintains 
a cost function 

C(q)=Mog^e^ /6 , (1) 

and an instantaneous price function for each security 

e <Wf> 

P"(«) = v — ^A' (2) 

where b is a positive parameter related to the depth of the 
market. The cost function captures the total money wa- 
gered in the market, and C(q°) reflects the market maker's 
maximum subsidy to the market. The instantaneous price 
function p^(q) gives the current cost of buying an infinitely 
small quantity of the security for outcome uj, and is the par- 
tial derivative of the cost function, i.e. p u (q) = dC(q)/dq LJ . 
We use p — (pui{q))uien to denote the price vector. Traders 
buy and sell securities through the market maker. If a trader 
wishes to change the number of outstanding shares from 
q to q, the cost of the transaction that the trader pays is 
C(q) — C(q), which equals the integral of the price functions 
following any path from q to q. 

When the outcome space is large, it is often natural to offer 
only compound securities on sets of outcomes. A compound 
security S pays $1 if one of the outcomes in the set S C 
occurs and $0 otherwise. Such a security is the combination 
of all securities u) S S. Buying or selling q shares of the 
compound security S is equivalent to buying or selling q 
shares of each security u 6 S. Let O denote the set of 
all allowable compound securities. Denote the outstanding 
shares of all compound securities as Q = (qs)see- The cost 



function can be written as 

C(Q) = blo g J2e Esee: " esqs/b 
wen 

= &io g ]T n e " s/b ■ ( 3 ) 

usf2 see-.ues 

The instantaneous price of a compound security S is com- 
puted as the sum of the instantaneous prices of the securities 
that compose the compound security S, 

E u6S eEs ' ee: " es ' qs ' /b 

V TT e q s'/ b 
~ V TT e q s'/ b ' 1 j 

Logarithmic market scoring rules are so named because they 
are based on logarithmic scoring rules. A logarithmic scoring 
rule is a set of reward functions 

{suj(r) = + 61og(r w ) : lu G Q}, 

where r — (r u ) u6 n is a probability distribution over f2, and 
a„ is a free parameter. An agent who reports r is rewarded 
s„(r) if outcome ui happens. Logarithmic scoring rules are 
proper in the sense that when facing them a risk-neutral 
agent will truthfully report his subjective probability distri- 
bution to maximize his expected reward. A LMSR market 
can be viewed as a sequential version of logarithmic scor- 
ing rule, because by changing market prices from p to p a 
trader's net profit is ^(p) — s^(p) when outcome uj hap- 
pens. At any time, a trader in a LMSR market is essentially 
facing a logarithmic scoring rule. 

LMSR markets have many desirable properties. They of- 
fer consistent pricing for combinatorial events. As market 
maker mechanisms, they provide infinite liquidity by allow- 
ing trades at any time. Although the market maker sub- 
sidizes the market, he is guaranteed a worst-case loss no 
greater than C(q°), which is blogn if |£7| = n and the mar- 
ket starts with share of every security. In addition, it 
is a dominant strategy for a myopic risk-neutral trader to 
reveal his probability distribution truthfully since he faces 
a proper scoring rule. Even for forward-looking traders, 
truthful reporting is an equilibrium strategy when traders' 
private information is independent conditional on the true 
outcome [7]. 

3.2 Complexity of Counting 

The well-known class NP contains questions that ask 
whether a search problem has a solution, such as whether a 
graph is 3-colorable. The class #P consists of functions that 
count the number of solutions of NP search questions, such 
as the number of 3-colorings of a graph. 

A function g is #P-hard if, for every function / in #P, it 
is possible to compute / in polynomial time given an ora- 
cle for g. Clearly oracle access to such a function g could 
additionally be used to solve any NP problem, but in fact 
one can solve much harder problems too. Toda [24] showed 



that every language in the polynomial-time hierarchy can be 
solved efficiently with access to a #P-hard function. 

To show a function g is a #P-hard function, it is sufficient to 
show that a function / reduces to g where / was previously 
known to be #P-hard. In this paper we use the following 
#P-hard functions to reduce from: 

• Permanent: The permanent of an n-by-n matrix A = 
(ai.j) is defined as 

n 

perm(A) = II ai . CT « ' ( 5 ) 

crfci! 1 = 1 

where f2 is the set of all permutations over {1, 2, n}. 
Computing the permanent of a matrix A containing 0- 
1 entries is #P-hard [25] . 

• #2-SAT: Counting the number of satisfying assign- 
ments of a formula given in conjunctive normal form 
with each clause having two literals is #P-hard |26] . 

• Counting Linear Extensions: Counting the num- 
ber of total orders that extend a partial order given by 
a directed graph is #P-hard [2]. 

#P-hardness is the best we can achieve since all the func- 
tions in this paper can themselves be reduced to some other 
#P function. 

4. LMSR FOR PERMUTATION BETTING 

In this section we consider a particular type of market com- 
binatorics in which the final outcome is a ranking over 
n competing candidates. Let the set of candidates be 
Nn = {l,...,n}, which is also used to represent the set 
of positions. In the setting, Q is the set of all permutations 
over J\f n . An outcome a £ f2 is interpreted as the scenario 
in which each candidate i ends up in position a(i). Chen et 
al. [4] propose two betting languages, subset betting and pair 
betting, for this type of combinatorics and analyze the com- 
plexity of the auctioneer's order matching problem for each. 
In what follows we address the complexity of operating an 
LMSR market for both betting languages. 

4.1 Subset Betting 

As in Chen et al. [4], participants in a LMSR market for 
subset betting may trade two types of compound securities: 
(1) a security of the form where $ C AT n is a subset of 
positions; and (2) a security where C M n is a subset 

of candidates. The security (i\&) pays off $1 if candidate i 
stands at a position that is an element of $ and $0 other- 
wise. Similarly, the security {^\j) pays off $1 if any of the 
candidates in finishes at position j and $0 otherwise. For 
example, in a horse race, participants can trade securities of 
the form "horse A will come in the second, fourth, or fifth 
place", or "either horse B or horse C will come in the third 
place". 

Note that owning one share of (i\&) is equivalent to owning 
one share of for every j G and similarly owning one 
share of {^\j) is equivalent to owing one share of (i\j) for 
every i £ We restrict our attention to a simplified market 



where securities traded are of the form We show that 

even in this simplified market it is #P-hard for the market 
maker to provide the instantaneous security prices, evaluate 
the cost function, or calculate payments for transactions, 
which implies that the running an LMSR market for the 
more general case of subset betting is also #P-hard. 

Traders can trade securities for all i £ N n and j £ 

M n with the market maker. Let qij be the total number 
of outstanding shares for security in the market. Let 
Q — {li,j)ieAf n ,jeAf n denote the outstanding shares for all 
securities. The market maker keeps track of Q at all times. 
From Equation [4] the instantaneous price of security is 



i(Q) = 



E 



o-£f2:<r(i)=j 1 lfc = l 



n; 



(6) 



ErSfi llfe= 1 

and from Equation the cost function for subset betting is 



C(Q) = b\ogJ2U e 



9k,t,(kf/ b 



(7) 



o-efi fe=i 



We will show that computing instantaneous prices, the cost 
function, and/or payments of transactions for a subset bet- 
ting market is #P-hard by a reduction from the problem of 
computing the permanent of a (0,l)-matrix. 

Theorem 1. It is #P-hard to compute instantaneous 
prices in a LMSR market for subset betting. Additionally, it 
is ^P-hard to compute the value of the cost function. 

Proof. We show that if we could compute the instan- 
taneous prices or the value of the cost function for subset 
betting for any quantities of shares purchased, then we could 
compute the permanent of any (0, l)-matrix in polynomial 
time. 

Let n be the number of candidates, A = (dij) be any n-by- 
n (0,l)-matrix, and N = n! + 1. Note that fJ2=i a i.<y{i) ls 
either or 1. From Equation [5] perm(A) < n! and hence 
perm(A) mod N = perm(A). We show how to compute 
perm(A) mod N from prices in subset betting markets in 
which qij shares of have been purchased, where g ivJ - is 
defined by 



if dij = 0, 



blniV 

b\n(N+l) ifaij 
for any i 6 J\f n and any j £ J\f n ■ 



(8) 



Let B = (bij) be a n-by-n matrix containing entries of the 
form bij = e qi ' j/b . Note that bij = N if atj = and 
bij = JV + 1 if ai.j = 1. Thus, perm(A) mod N = perm(B) 
mod N. Thus, from Equation [6] the price for in the 
market is 



Pi,j(Q) = 



E CT gn:o-(i)=j life=l ^ 



fe,(T(fe) 



Er£f2 Hk=l b k,r(k) 

Etgo rifc=i b k,r(k) 

• perm(M 4j ) 
perm(B) 



where Mij is the matrix obtained from B by removing the 
iih row and j'th column. Thus the ability to efficiently 
compute prices gives us the ability to efficiently compute 
perm(A/i i j)/perm(_B). 

It remains to show that we can use this ability to compute 
perm(_B). We do so by telescoping a sequence of prices. 
Let Bi be the matrix B with the first i rows and columns 
removed. From above, we have perm(_Bi)/perm(_B) — 
pi,i(Q)/6i,i. Define Q m to be the (n — m)-by-(n— m) matrix 
{qi,j)i>m,j>m,, that is, the matrix of quantities of securities 
(qi.j) with the first k rows and columns removed. In a mar- 
ket with only n — m candidates, applying the same technique 
to the matrix Q m , we can obtain perm(B m +i)/perm(B m ) 
from market prices for m = 1, (n — 2). Thus by computing 
n — 1 prices, we can compute 



/ perm(Bi)\ / perm(_B 



\ perm(B) J \perm(_B 



perm(B n _ 2 ; 
perm(j? n _i' 
perm(_B) 



Noting that B n -i only has one element, we thus can com- 
pute perm(_B) from market prices. Consequently, perm(B) 
mod JV gives perm(yl). 

Therefore, given a 7i-by-n (0, l)-matrix A, we can compute 
the permanent of A in polynomial time using prices in n — 1 
subset betting markets wherein an appropriate quantity of 
securities have been purchased. 

Additionally, note that 

n 

C{Q) = blog^ Yl b k,v(k) = fclogperm(B) . 

o-gn fc=i 

Thus if we can compute C(Q), we can also compute 
perm(A). 

As computing the permanent of a (0, l)-matrix is #P-hard, 
both computing market prices and computing the cost func- 
tion in a subset betting market are #P-hard. □ 



Corollary 2. Computing the payment of a transaction 
in a LMSR for subset betting is #P-hard. 

Proof. Suppose the market maker starts the market 
with share of every security. Denote Q° as the initial 
quantities of all securities. If the market maker can compute 
C(Q) — C(Q) for any quantities Q and Q, it can compute 
C(Q) - C(Q°) for any Q. As C(Q°) = fologn!, the market 
maker is able to compute C(Q). According to Theorem [T] 
computing the payment of a transaction is #P-hard. □ 

4.2 Pair Betting 

In contrast to subset betting, where traders bet on absolute 
positions for a candidate, pair betting allows traders to bet 
on the relative position of a candidate with respect to an- 
other. More specifically, traders buy and sell securities of 
the form (i > j), where i and j are candidates. The security 
pays off $1 if candidate i ranks higher than candidate j (i.e., 
<r(i) < a(j) where a is the final ranking of candidates) and 



$0 otherwise. For example, traders may bet on events of the 
form "horse A beats horse B", or "candidate C receives more 
votes than candidate D". 

As for subset betting, the current state of the market is 
determined by the total number of outstanding shares for all 
securities. Let qij denote the number of outstanding shares 
for (i > j). Applying Equations [3] and [4] to the present 
context, we find that the instantaneous price of the security 
is given by 



1 U' .•^crfi'V"'*'' 1 



d(l:a(i]<a(j) 1 U> , j' :a (i') «r(j' ) 



ET-r q i t jb 

Ten Hi' ,j':T(i')<T(i') e 1 ' 3 

and the cost function for pair betting is 

c(Q)=Mog]r n e9 ^ /b ■ 

CT6f! i,j:cr(i)<cr(J) 



, (9) 



(10) 



We will show that computing prices, the value of the cost 
function, and/or payments of transactions for pair betting 
is #P-hard via a reduction from the problem of computing 
the number of linear extensions to any partial ordering. 



Theorem 3. It is #P-hard to compute instantaneous 
prices in a LMSR market for pair betting. Additionally, it 
is #P-hard to compute the value of the cost function. 

Proof. Let P be a partial order over {1, ...,n}. Wc 
recall that a linear (or total) order T is a linear extension of 
P if whenever x < y in P it also holds that x < y in T. We 
denote by M(P) the number of linear extensions of P. 

Recall that (i, j) is a covering pair of P if i < j in P and 
there does not exist £ 7^ i,j such that i < £ < j. Let 
{(ii, ji), (i 2 , 32), ... , (ik,jk)} be a set of covering pairs of 
P. Note that covering pairs of a partially ordered set with 
n elements can be easily obtained in polynomial time, and 
that their number is less than n 2 . 

We will show that we can design a sequence of trades that, 
given a list of covering pairs for P, provide N(P) through a 
simple function of market prices. 

We consider a pair betting market over n candidates. We 
construct a sequence of k trading periods, and denote by 
qj j and p\j respectively the outstanding quantity of security 
(i > j) and its instantaneous price at the end of period t. 
At the beginning of the market, q° j = for any i and j. At 
each period t, < t < k, 6 Inn! shares of security {it > jt) 
are purchased. 



Let 



and 



Nt(iJ)= e n 

CTGn:CT(i)<<r(j) i' ,j':a(i')<cr(j') 



q% ;/b 

e « <r 



d *=e n 

o-Gfi i',j':cr(i')<a(j') 



I 1 , ,/b 

e * <r 



Note that according to Equation [9l p\ t j t = N t (it, jt)/ Dt 



For the first period, as only the security (ii > j\) is pur- 
chased, we get 



D 1= £ n!+ £ 1 

o-Gf2:CT(i 1 )<o-(j 1 ) a-:cr(i 1 )>a(j 1 ) 



(n!) 2 + n! 



We now show that can be calculated inductively from D 1 
using successive prices given by the market. During period 
t, &lnn! shares of (i t > jt) are purchased. Note also that 
the securities purchased are different at each period, so that 
lit -it = if s < t and qt t j t = blnn! if s > t. We have 



NSuJt) = Nt^(it,jt)e bln( - n])/b = n!JV t _i(it,it) • 



Hence, 



Ph, 



Nt(it,jt)/D t _ n\D t - 



Pit,h 
and therefore, 



N t -i(it,jt)/D t - 



D k = (n! 



v fc-l / TT ri t ,n 



n 



Dt 



D 1 



So Dt can be computed in polynomial time in n from the 
prices. 

Alternately, since the cost function at the end of period k 
can be written as C(Q) = blogDk, P>k can also be computed 
efficiently from the cost function in period k. 

We finally show that given Dk, we can compute Af(P) in 
polynomial time. Note that at the end of the k trading pe- 
riods, the securities purchased correspond to the covering 
pairs of P, such that e qi -^ b — n! if (i, j) is a covering pair 
of P and e qi -^ b = 1 otherwise. Consequently, for a per- 
mutation a that satisfies the partial order P, meaning that 
a(i) < a(j) whenever i < j in P, we have 



n * 9 

i',j':<r(i')<<r(j') 



' /b = (n!) fe 



On the other hand, if a permutation a does not satisfy P, 
it does not satisfy at least one covering pair, meaning that 
there is a covering pair of P, (i,j), such that a(i) > o~(j), so 
that 



n 



q K , .,/b - / iNfc-1 

e'»'.j" < (n!) 



i',j':cr(i')<a(j') 

Since the total number of permutations is n\, the total sum 
of all terms in the sum corresponding to permutations 
that do not satisfy the partial ordering P is less than or 



equal to n!(n!) 



(n!) , and is strictly less than (r 



unless the number of linear extensions is 0, while the total 
sum of all the terms corresponding to permutations that do 
satisfy P is jV(P)(nl) k . Thus M{P) = [D k /(n\) k \ . 

We know that computing the number of linear extensions of 
a partial ordering is #P-hard. Therefore, both computing 
the prices and computing the value of the cost function in 
pair betting are #P-hard. □ 



Corollary 4. Computing the payment of a transaction 
in a LMSR for pair betting is #P-hard. 



The proof is nearly identical to the proof of Corollary [2] 

5. LMSR FOR BOOLEAN BETTING 

We now examine an alternate type of market combinatorics 
in which the final outcome is a conjunction of event out- 
comes. Formally, let A be event space, consisting of N indi- 
vidual events A\, • ■ ■ , Ajv, which may or may not be mutu- 
ally independent. We define the state space Q be the set of 
all possible joint outcomes for the N events, so that its size 
is |fi| = 2^. A Boolean betting market allows traders to bet 
on Boolean formulas of these events and their negations. A 
security {(/>) pays off $1 if the Boolean formula <f> is satisfied 
by the final outcome and $0 otherwise. For example, a secu- 
rity {Ai V A2) pays off $1 if and only if at least one of events 
Ai and A2 occurs, while a security (Ai A A3 A ^As) pays off 
$1 if and only if the events Ai and A3 both occur and the 
event A5 does not. Following the notational conventions of 
Fortnow et al. [11], we use uj £ (f> to mean that the outcome 
uj satisfies the Boolean formula (f>. Similarly, uj <f> implies 
that the outcome uj does not satisfy 4>- 

In this section, we focus our attention to LMSR markets for 
a very simple Boolean betting language, Boolean formulas of 
two events. We show that even when bets are only allowed 
to be placed on disjunctions or conjunctions of two events, it 
is still #P-hard to calculate the prices, the value of the cost 
function, and payments of transactions in a Boolean betting 
market operated by a LMSR market maker. 

Let X be the set containing all elements of A and their 
negations. In other words, each event outcome Xi G X is 
either Aj or -<Aj for some Aj £ A. We begin by consider- 
ing the scenario in which traders may only trade securities 
(XiVXj) corresponding to disjunctions of any two event 
outcomes. 

Let qij be the total number of shares purchased by all 
traders for the security {Xi VX 3 ), which pays off $1 in the 
event of any outcome uj such that uj £ {Xi V Xj) and $0 
otherwise. From Equation [4] we can calculate the instan- 
taneous price for the security {Xi V Xj) for any two event 
outcomes Xi ,Xj £ X as 



Pi,i(Q) = 



2JV:^e(X i , VXy ) 



FT q i' i' l h 

Zji-en lli<i'<j'<2jv:Te(x., vx.,) e 



(11) 



Note that if Xi = —<Xj, Pi,j{Q) is always $1 regardless of 
how many shares of other securities have been purchased. 
According to Equation [31 the cost function is 



C(Q) = blogJ2 



n 



(12) 



l< 1 <i<2fI:u6(X i VX j ) 



Theorem [5] shows that computing prices and the value of the 
cost function in such a market is #P-hard, via a reduction 
from the #2-SAT problemQ 



restricted to disjunctions of two event outcomes. Addition- 
ally, it is #P-hard to compute the value of the cost function 
in this setting. 

Proof. Suppose we are given a 2-CNF (Conjunctive Nor- 
mal Form) formula 

(X n V X h ) A (X !2 V Xj 2 ) A ■ ■ ■ A (X ik V X ]k ) (13) 

with k clauses, where each clause is a disjunction of two liter- 
als (i.e. events and their negations) . Assume any redundant 
terms have been removed. 

The structure of the proof is similar to that of the pair bet- 
ting case. We consider a Boolean betting markets with JV 
events, and show how to construct a sequence of trades that 
provides, through prices or the value of the cost function, the 
number of satisfiable assignments for the 2-CNF formula. 

We create k trading periods. At period t, a quantity 6 ln(2 JV ) 
of the security {Xi t V Xj t ) is purchased. We denote by pjj 
and qj j respectively the price and outstanding quantities 
of the security {Xi V Xj) at the end of period t. Suppose 
the market starts with share of every security. Note that 
qt ujt = if s < t and q! tJt = bln{2 N ) ]£s>t. Let 



e n 

ui£n:u£(XiVXj) l<i'<j'<2JV:ue(X i / VX ., ) 

and 



Q*, ;/b 

e * .j 



*=z n 

wen X<i'<j' <2N:u/ £ (X i / VX -/ ) 



3-/ ;/b 

e » •! 



Thus, P ij=N t (i t ,jt)/D t . 

Since only one security {Xi 1 V Xj 1 ) has been purchased in 
period 1, we get 



Di 



E 2 "+ E 1 



3-2 2N - 2 + 2 N ~ 2 . 



We then show that Dk can be calculated inductively from 
D\. As the only security purchased in period t is (X; t V Xj t ) 
in quantity 6^(2^), we obtain 

N t (i t ,j t ) = JV t _i(it, j t )e bh ^ N)/b = N t ^(i t ,j t )2 N . 
Therefore, 



Pit ,3 



N t {it,j t )/D t _ 2 N D t - 



Nt-i{it,jt)/D t - 



and we get 



Theorem 5. It is ^V-hard to compute instantaneous 
prices in a LMSR market for Boolean betting when bets are 

1 This can also be proved via a reduction from counting linear 
extensions using a similar technique to the proof of Theo- 
rem [3j but the reduction to #2-SAT is more natural. 



In addition, since the cost function at the end of period k 
can be expressed as 

C(Q) =b log D k , 

Dk can also be computed efficiently from the cost function 
in period k. 



We now show that we can deduce from Dk the number of 
satisfiable assignments for the 2-CNF formula (Equation ll3|) . 
Indeed, each term in the sum 

E II e^' /6 

that corresponds to an outcome lj that satisfies the formula 
is exactly 2 kN , as exactly k terms in the product are 2 N 
and the rest is 1. On the contrary, each term in the sum 
that corresponds to an outcome u that does not satisfy the 
2-CNF formula will be at most 2 (fc ~ 1)JV since at most k - 1 
terms in the product will be 2^ and the rest will be 1. Since 
the total number of outcomes is 2^, the total sum of all 
terms corresponding to outcomes that do not satisfy (|13[1 is 
less than or equal to 2 N (2 {k ~ 1)N ) = 2 kN , and is strictly less 
than 2 kN unless the number of satisfying assignments is 0. 
Thus the number of satisfying assignments is \jDk/2 kN \. 

We know that computing the number of satisfiable assign- 
ments of a 2-CNF formula is #P-hard. We have shown 
how to compute it in polynomial time using prices or the 
value of the cost function in a Boolean betting market of 
N events. Therefore, both computing prices and computing 
the value of the cost function in a Boolean betting market 
is #P-hard. □ 

COROLLARY 6. Computing the payment of a transaction 
in a LMSR for Boolean betting is #P-hard when traders can 
only bet on disjunctions of two events. 

The proof is nearly identical to the proof of Corollary [2] 

If we impose that participants in a Boolean betting market 
may only trade securities corresponding to conjunctions of 
any two event outcomes, {Ai A Aj), the following Corollary 
gives the complexity results for this situation. 

Corollary 7. It is #P-hard to compute instantaneous 
prices in a LMSR market for Boolean betting when bets are 
restricted to conjunctions of two event outcomes. Addition- 
ally, it is ^P-hard to compute the value of the cost function 
in this setting, and ^P-hard to compute the payment for a 
transaction. 

Proof. Buying q shares of security {Ai A Aj) is equiv- 
alent to selling q shares of (->Aj V ->Aj). Thus if we can 
operate a Boolean betting market for securities of the type 
{Ai A Aj) in polynomial time, we can also operate a Boolean 
betting market for securities of the type {Ai V Aj) in poly- 
nomial time. The result then follows from Theorem [S] and 
Corollary El □ 

6. AN APPROXIMATION ALGORITHM 
FOR SUBSET BETTING 

There is an interesting relationship between logarithmic 
market scoring rule market makers and a common class of 
algorithms for online learning in an experts setting. In this 
section, we elaborate on this connection, and show how re- 
sults from the online learning community can be used to 
prove new results about an approximation algorithm for sub- 
set betting. 



6.1 The Experts Setting 

We begin by describing the standard model of online learn- 
ing with expert advice |19l 1121 12"7] . In this model, at each 
time t £ {1, • • • , T}, each expert i £ {1, • • • , n} receives a 
loss £i : t € [0,1]- The cumulative loss of expert i at time 
T is d.T = Ylt=i No statistical assumptions are made 
about these losses, and in general, algorithms are expected 
to perform well even if the sequence of losses is chosen by 
an adversary. 

An algorithm A maintains a current weight Wi,t for each ex- 
pert i, where ET=i Wi > t = These weights can be viewed as 
distributions over the experts. The algorithm then receives 
its own instantaneous loss = ~^2™ =1 Wi :t £i,t, which may 
be interpreted as the expected loss of the algorithm when 
choosing an expert according to the current distribution. 
The cumulative loss of A up to time T is then defined in 

the natural way as Ca.t = Yh=i ^A,t = Et=i Z)"=i w i,t^i,t- 
A common goal in such online learning settings is to mini- 
mize an algorithm's regret. Here the regret is defined as the 
difference between the cumulative loss of the algorithm and 
the cumulative loss of an algorithm that would have "cho- 
sen" the best expert in hindsight by setting his weight to 1 
throughout all the periods. Formally, the regret is given by 
Ca.t - min 4G{lj ... >n } C i>T . 

Many algorithms that have been analyzed in the online 
experts setting are based on exponential weight updates. 
These exponential updates allow the algorithm to quickly 
transfer weight to an expert that is outperforming the oth- 
ers. For example, in the Weighted Majority algorithm of 
Littlestone and Warmuth [19], the weight on each expert i 
is defined as 

Wi.t-ie-*'.' e-" £ '-' 
^ E"=i t^t-ie-*.* ~ E* =1 e-" £ i.« ' [ ' 

where 77 is the learning rate, a small positive parameter that 
controls the magnitude of the updates. The following theo- 
rem gives a bound on the regret of Weighted Majority. For 
a proof of this result and a nice overview of learning with 
expert advice, see, for example, Cesa-Bianchi and Lugosi [3]. 

Theorem 8. Let A be the Weighted Majority algorithm 
with parameter r\. After a sequence of T trials, 

Ca.t - min U.t < rfT + . 
ie{i, •••,«} 77 

6.2 Relationship to LMSR Markets 

There is a manifest similarity between the expert weights 
used by Weighted Majority and the prices in the LMSR 
market. One might ask if the results from the experts setting 
can be applied to the analysis of prediction markets. Our 
answer is yes. In fact, it is possible to use Theorem [8] to 
rediscover the well-known bound of 61n(n) for the loss of an 
LMSR market maker with n outcomes. 

Let e be a limit on the number of shares that a trader may 
purchase or sell at each time step; in other words, if a trader 
would like to purchase or sell q shares, this purchase must be 
broken down into \q/t \ separate purchases of e or less shares. 
Note that the total number of time steps T needed to execute 
such a sequence of purchases and sales is proportional to 1/e. 



We will construct a sequence of loss functions in a setting 
with n experts to induce a sequence of weight matrices that 
correspond to the price matrices of the LMSR market. At 
each time step t, let pi tt £ [0, 1] be the instantaneous price 
of security i at the end of period t, and let q i)t G [— e, e] be 
the number of shares of security i purchased during period 
t. Let Qi : t be the total number of shares of security i that 
have been purchased up to time t. Now, let's define the 
instantaneous loss of each expert as li.t = (2e — qi.t) /(rib). 
First notice that this loss is always in [0, 1] as long as r\ > 
2e/b. From Equations [2l and 1141 at each time t, 



a Q,,t/b 



a 2et/b-r,C it 



Pi,t 



£" £"e 



2et/b — r)Cj t 



-vCi 



E 



Wi.t 



Applying Theorem [8] and rearranging terms, we find that 

T T n 

max ^ qi.t - Tjpi.tgi.t < y 2 Tb + bln(n). 

»e{i>— ,«} t=1 t=1 i=1 

The first term of the left-hand side is the maximum pay- 
ment that the market maker needs to make, while the second 
terms of the left-hand side captures the total money the mar- 
ket maker has received. The right hand side is clearly min- 
imized when rj is set as small as possible. Setting r\ = 2e/b 
gives us 

T T n 

max 



Since T = 0(1/ e), the term 4e 2 Tb goes to as e becomes 
very small. Thus in the limit as e — > 0, we get the well- 
known result that the worst-case loss of the market maker 
is bounded by 61n(n). 

6.3 Considering Permutations 

Recently Helmbold and Warmuth [16] have shown that many 
results from the standard experts setting can be extended 
to a setting in which, instead of competing with the best 
expert, the goal is to compete with the best permutation 
over n items. Here each permutation suffers a loss at each 
time step, and the goal of the algorithm is to maintain a 
weighting over permutations such that the cumulative regret 
to the best permutation is small. It is infeasible to treat 
each permutation as an expert and run a standard algorithm 
since this would require updating n\ weights at each time 
step. Instead, they show that when the loss has a certain 
structure (in particular, when the loss of a permutation is 
the sum of the losses of each of the n mappings) , an alternate 
algorithm can be used that requires tracking only n 2 weights 
in the form of an n x n doubly stochastic matrix. 



Formally, let W be a doubly stochastic matrix of weights 
maintained by the algorithm A at time t. Here W*j is the 
weight corresponding to the probability associated with item 
i being mapped into position j. Let L G [0, l] nxn be the loss 
matrix at time t. The instantaneous loss of a permutation 
a at time t is l a ,t = ^27=1 ■ The instantaneous loss 

of A is lA,t = Yli=i Y^j=i WljL\ j, the matrix dot product 



between W* and L*. Notice that £^,t is equivalent to the 
expectation over permutations a drawn according to W t of 
l a .t- The goal of the algorithm is to minimize the cumulative 
regret to the best permutation, La.t — mhio-gn L a .T where 
the cumulative loss is defined as before. 

Helmbold and Warmuth go on to present an algorithm called 
PermELearn that updates the weight matrix in two steps. 
First, it creates a temporary matrix W' , such that for every i 
and j, W-j = Wl^^-i . It then obtains by repeat- 

edly rescaling the rows and columns of W' until the matrix is 
doubly stochastic. Alternately rescaling rows and columns 
of a matrix M in this way is known as Sinkhorn balanc- 
ing [22| . Normalizing the rows of a matrix is equivalent to 
pre-multiplying by a diagonal matrix, while normalizing the 
columns is equivalent to post-multiplying by a diagonal ma- 
trix. Sinkhorn [22] shows that this procedure converges to a 
unique doubly stochastic matrix of the form RMC where R 
and C are diagonal matrices if M is a positive matrix. Al- 
though there are cases in which Sinkhorn balancing does not 
converge in finite time, many results show that the number 
of Sinkhorn iterations needed to scale a matrix so that row 
and column sums are 1 ± e is polynomial in 1/e [I] 1171 118] . 

The following theorem [TB] bounds the cumulative loss of 
the PermELearn in terms of the cumulative loss of the best 
permutation. 

Theorem 9. (Helmbold and Warmuth fl6f ) Let A be the 
PermELearn algorithm with parameter rj. After a sequence 
of T trials, 



nln(n) + 7ymin CTe n £ ct ,t 



6.4 Approximating Subset Betting 

Using the PermELearn algorithm, it is simple to approxi- 
mate prices for subset betting in polynomial time. We start 
with a n x n price matrix P 1 in which all entries are 1/n. 
As before, traders may purchase securities of the form 
that pay off $1 if and only if horse or candidate i finishes 
in a position j <E or securities of the form (*5>\j) that pay 
off $1 if and only if a horse or candidate i £ f finishes in 
position j. 

As in Section 16.21 each time a trader purchases or sells q 
shares, the purchase or sale is broken up into \q/e] purchases 
or sales of e shares or less, where e > is a small constant 
Thus we can treat the sequence of purchases as a sequence 
of T purchases of e or less shares, where T = 0(1/ e). Let 
qjj be the number of shares of securities with j £ $ 

or ($f\j) with i £ $ purchased at time t; then q\ j 6 [— e, e] 
for all i and j. 

The price matrix is updated in two steps. First, a temporary 
matrix P' is created where for every i and j, P[ j = P/^e 9 *^ l h 

2 We remark that dividing purchases in this way has the 
negative effect of creating a polynomial time dependence 
on the quantity of shares purchased. However, this is not 
a problem if the quantity of shares bought or sold in each 
trade is bounded to start, which is a reasonable assumption. 
The additional time required is then linear only in 1/e. 



where b > is a parameter playing a similar role to b in 
Equation [2] Next, P' is Sinkhorn balanced to the desired 
precision, yielding an (approximately) doubly stochastic ma- 
trix P t+1 .' 

The following lemma shows that updating the price matrix 
in this way results in a price matrix that is equivalent to the 
weight matrix of PermELearn with particular loss functions. 



Lemma 10. The sequence of price matrices obtained by 
the approximation algorithm for subset betting on a sequence 
of purchases q l 6 [— e, e] nxn is equivalent to the sequence of 
weight matrices obtained by running PermELearn^) on a 
sequence of losses L with 



2e 



li.j 



rjb 



for all i and j, for any r\ > 2e/b. 



Thus we have 
'/ 



T n n 



T—^i ™£ E E «U« - E E E p id,i 



1 - Q-'' 



bnln(n) + 



t=i i=i j=i 
'/ 



1 - e-'> 



1 2enT 



Notice that — e - " 7 ) is an increasing function in n, and 
goes to 1 in the limit as 77 goes to 0. Thus the right hand 
side of this equation decreases as r\ decreases to 0. Setting 
77 = 2e/b yields the result. □ 



Notice that the number of steps T scales inversely with e 
since each lump purchase of q shares must be broken into 
\q/e] individual purchases. Thus in the limit as e — » 0, the 
loss of the market maker is bounded by bnln(?i). 



Proof. First note that for any 77 > 2e/b, L*j £ [0, 1] for 
all t, i, and j, so the loss matrix is valid for PermELearn. 
P 1 and W 1 both contain all entries of 1/n. Assume that 
P* = W l . When updating weights for time t + 1, for all i 
and j, 



P' 



P\^ lb = W\.^ lb = e^ /b W^e-^ /b+q ^ 



/b 



2e/6 



S ' Wi. 



;3 ° "1,3 ■ 

Since the matrix W' is a constant multiple of P' , the 
Sinkhorn balancing step will produce the same matrices. □ 



Using this lemma, we can show that the difference between 
the amount of money that the market maker must distribute 
to traders in the worst case (i.e. when the true outcome 
is the outcome that pays off the most) and the amount of 
money collected by the market is bounded. We will see in 
the corollary below that as e approaches 0, the worst case 
loss of the market maker approaches &nln(n), regardless of 
the number of shares purchased. Unfortunately, if e > 0, 
this bound can grow arbitrarily large. 



Theorem 11. For any sequence of valid subset betting 
purchases q l where qjj £ [— e,e] for all t, i, and j, let 
P 1 , ■ ■ ■ , P T be the price matrices obtained by running the 
subset betting approximation algorithm. Then 



T n n 



™! x E E ffkw -EEE P ' 



jl',3 



t=l i=l 3 = 1 



„ raln(n) + ( „ - 1 ] 

l_ e -2e/i, v > -r y l _ c - 2c / b J 



2enT 



Proof. By Theorem[9]and Lemma ITOl after rearranging 
terms, we have that for any 77 > 2e/b, 

T n n 

2mT -EEE^ 

t = l i=l j = l 



bn In n + 2enT — max Qi,cr(i 



1 - c-'i 



Corollary 12. For any sequence of valid subset betting 
purchases broken into T (= 0(l/e)) small purchases such 
that qlj £ [— e, e] for all t, i, and j, let P 1 , ■ ■ ■ , P T be the 
price matrices obtained by running the Subset Betting Ap- 
proximation Algorithm. Then in the limit as e — > 0, 



™l x E E «»>(<) - E E E p < 

™ i=l i=l t=l i=l j = l 



t t 
i,jQi,3 



< bnln(n) 



This bound is comparable to worst-case loss bounds achieved 
using alternate methods for operating LMSRs on permuta- 
tions. A single LMSR operated on the entire outcome space 
has a guaranteed worst-case loss of 61n(n!), but is, of course, 
intractible to operate. A set of n LMSRs operated as n sep- 
arate markets, one for each position, would also have a total 
worst-case loss bn\n(n), but could not guarantee consistent 
prices. In the limit, our approximation algorithm achieves 
the same worst-case loss guarantee as if we were operating n 
separate markets, but prices remain consistent at all times. 

7. CONCLUSIONS 

We investigate the computational complexity for logarithmic 
market scoring rule (LMSR) market makers to operate com- 
binatorial betting markets. We examine two specific market 
combinatorics, permutations and Boolean combinations. In 
a permutation betting market, the state space is the set of 
rankings of n competing candidates and is of size of n!. In a 
Boolean betting market, the state space is the set of Boolean 
combinations of iV event outcomes and is of size 2^. 

Since allowing participants to bet on individual states is 
both intractable and unnatural, we allow participants to 
trade securities that correspond to simple properties of the 
final state. For permutation betting, we consider subset bet- 
ting, which allows traders to bet on a set of positions that a 
candidate will stand at or a set of traders who will stand at a 
particular position, and pair betting, which allows traders to 
wager on the relative ranking of two candidates. For Boolean 
betting, we consider the situation where traders bet on con- 
junctions or disjunctions of two events. In all cases, we prove 
that it is #P-hard to compute the instantaneous prices as 
well as payments of transactions in a LMSR market. Our 



results on subset betting in LMSR contrast with those of 
Chen et al. [4] who show that subset betting is tractable 
when the market is operated by a central auctioneer who 
performs riskless order matching. Our results on Boolean 
betting contrast with those of Chen, Goel, and Pennock [B] 
who consider betting on single-elimination tournaments — a 
special form of Boolean combinations — and show that such 
a market with LMSR may be operated in polynomial time. 
This raises interesting questions on the connection between 
the complexity of auctioneers and the complexity of mar- 
ket makers, and on the complexity of other combinatorial 
betting scenarios. 

We also show that there is an interesting relationship be- 
tween LMSR market makers and a common class of expert 
learning algorithms. This allows us to design an approxima- 
tion algorithm for pricing subset betting in an LMSR. We 
prove that in the limit the worst-case loss of an LMSR mar- 
ket maker that uses our algorithm remains bounded. In the 
future we plan to further investigate the connection between 
online learning in expert settings and information markets 
with automated market makers. 
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